Overview

Dataset Statistics

Number of Variables 15
Number of Rows 11150
Missing Cells 0
Missing Cells (%) 0.0%
Duplicate Rows 5562
Duplicate Rows (%) 49.9%
Total Size in Memory 6.9 MB
Average Row Size in Memory 648.1 B
Variable Types
  • Numerical: 5
  • Categorical: 10

Dataset Insights

Dataset has 5562 (49.88%) duplicate rows Duplicates
player has a high cardinality: 4866 distinct values High Cardinality
club has a high cardinality: 332 distinct values High Cardinality
Apps has a high cardinality: 589 distinct values High Cardinality
SpG has a high cardinality: 57 distinct values High Cardinality
AerialsWon has a high cardinality: 74 distinct values High Cardinality
Red has constant length 1 Constant Length

Variables

Rank

numerical

Approximate Distinct Count 770
Approximate Unique (%) 6.9%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 174.2 KB
Mean 194.3416
Minimum 1
Maximum 770
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Rank is skewed right (γ1 = 1.4447)

Quantile Statistics

Minimum 1
5-th Percentile 21
Q1 87
Median 169
Q3 252
95-th Percentile 496.55
Maximum 770
Range 769
IQR 165

Descriptive Statistics

Mean 194.3416
Standard Deviation 148.524
Variance 22059.3877
Sum 2.1669e+06
Skewness 1.4447
Kurtosis 2.4504
Coefficient of Variation 0.7642
  • Rank is not normally distributed (p-value 8.061449372842471e-14)
  • Rank has 552 outliers

player

categorical

Approximate Distinct Count 4866
Approximate Unique (%) 43.6%
Missing 0
Missing (%) 0.0%
Memory Size 849.4 KB

Length

Mean 13.0046
Standard Deviation 3.3208
Median 13
Minimum 3
Maximum 26

Sample

1st row Hakim Ziyech
2nd row Alireza Jahanbakhs...
3rd row Hirving Lozano
4th row David Neres
5th row Steven Berghuis

Letter

Count 134102
Lowercase Letter 112159
Space Separator 10694
Uppercase Letter 21943
Dash Punctuation 157
Decimal Number 0
  • player contains many words: 5707 words

club

categorical

Approximate Distinct Count 332
Approximate Unique (%) 3.0%
Missing 0
Missing (%) 0.0%
Memory Size 825.8 KB

Length

Mean 10.8408
Standard Deviation 4.3592
Median 10
Minimum 4
Maximum 23

Sample

1st row Ajax
2nd row AZ Alkmaar
3rd row PSV Eindhoven
4th row Ajax
5th row Feyenoord

Letter

Count 113457
Lowercase Letter 93095
Space Separator 6687
Uppercase Letter 20362
Dash Punctuation 112
Decimal Number 378
  • The largest value (fc) is over 3.12 times larger than the second largest value (moscow)

age

numerical

Approximate Distinct Count 25
Approximate Unique (%) 0.2%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 174.2 KB
Mean 26.8876
Minimum 17
Maximum 41
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • age is skewed right (γ1 = 0.3092)

Quantile Statistics

Minimum 17
5-th Percentile 21
Q1 24
Median 27
Q3 30
95-th Percentile 34
Maximum 41
Range 24
IQR 6

Descriptive Statistics

Mean 26.8876
Standard Deviation 4.1362
Variance 17.1085
Sum 299797
Skewness 0.3092
Kurtosis -0.3203
Coefficient of Variation 0.1538
  • age is not normally distributed (p-value 0.0035003519346910807)
  • age has 18 outliers

Apps

categorical

Approximate Distinct Count 589
Approximate Unique (%) 5.3%
Missing 0
Missing (%) 0.0%
Memory Size 748.2 KB

Length

Mean 3.7138
Standard Deviation 1.6222
Median 4
Minimum 1
Maximum 6

Sample

1st row 34
2nd row 33
3rd row 29
4th row 28(4)
5th row 31

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 26687

Mins

numerical

Approximate Distinct Count 2462
Approximate Unique (%) 22.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 174.2 KB
Mean 1438.1838
Minimum 15
Maximum 4410
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Mins is skewed right (γ1 = 0.4802)

Quantile Statistics

Minimum 15
5-th Percentile 271
Q1 617
Median 1282
Q3 2169
95-th Percentile 3033
Maximum 4410
Range 4395
IQR 1552

Descriptive Statistics

Mean 1438.1838
Standard Deviation 920.1698
Variance 846712.4141
Sum 1.6036e+07
Skewness 0.4802
Kurtosis -0.7573
Coefficient of Variation 0.6398

Goals

categorical

Approximate Distinct Count 32
Approximate Unique (%) 0.3%
Missing 0
Missing (%) 0.0%
Memory Size 719.1 KB
  • The largest value (-) is over 2.05 times larger than the second largest value (1)

Length

Mean 1.0376
Standard Deviation 0.1902
Median 1
Minimum 1
Maximum 2

Sample

1st row 9
2nd row 21
3rd row 17
4th row 14
5th row 18

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 4834
Decimal Number 6735
  • The top 2 categories (-, 1) take over 50.0%
  • The largest value (1) is over 1.95 times larger than the second largest value (2)

Assists

categorical

Approximate Distinct Count 17
Approximate Unique (%) 0.2%
Missing 0
Missing (%) 0.0%
Memory Size 718.7 KB
  • The largest value (-) is over 2.03 times larger than the second largest value (1)

Length

Mean 1.0078
Standard Deviation 0.08799
Median 1
Minimum 1
Maximum 2

Sample

1st row 15
2nd row 12
3rd row 8
4th row 11
5th row 12

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 5127
Decimal Number 6110
  • The top 2 categories (-, 1) take over 50.0%
  • The largest value (1) is over 1.87 times larger than the second largest value (2)

Yel

categorical

Approximate Distinct Count 17
Approximate Unique (%) 0.2%
Missing 0
Missing (%) 0.0%
Memory Size 718.9 KB

Length

Mean 1.0216
Standard Deviation 0.1454
Median 1
Minimum 1
Maximum 2

Sample

1st row 4
2nd row 3
3rd row 4
4th row 3
5th row 5

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 2153
Decimal Number 9238

Red

categorical

Approximate Distinct Count 4
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 718.7 KB
  • The largest value (-) is over 7.79 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row -
2nd row -
3rd row 2
4th row -
5th row 1

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 9788
Decimal Number 1362
  • The top 2 categories (-, 1) take over 50.0%
  • The largest value (1) is over 13.23 times larger than the second largest value (2)
  • Red has words of constant length

SpG

categorical

Approximate Distinct Count 57
Approximate Unique (%) 0.5%
Missing 0
Missing (%) 0.0%
Memory Size 736.8 KB

Length

Mean 2.6633
Standard Deviation 0.7484
Median 3
Minimum 1
Maximum 3

Sample

1st row 4.9
2nd row 4.3
3rd row 3.4
4th row 2.1
5th row 2.9

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 1051
Decimal Number 19372

PS

numerical

Approximate Distinct Count 505
Approximate Unique (%) 4.5%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 174.2 KB
Mean 76.6633
Minimum 31.8
Maximum 96.1
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • PS is skewed left (γ1 = -1.0165)

Quantile Statistics

Minimum 31.8
5-th Percentile 59
Q1 71.9
Median 78.1
Q3 83
95-th Percentile 89.2
Maximum 96.1
Range 64.3
IQR 11.1

Descriptive Statistics

Mean 76.6633
Standard Deviation 9.2749
Variance 86.0243
Sum 854796
Skewness -1.0165
Kurtosis 1.5201
Coefficient of Variation 0.121
  • PS has 383 outliers

AerialsWon

categorical

Approximate Distinct Count 74
Approximate Unique (%) 0.7%
Missing 0
Missing (%) 0.0%
Memory Size 738.0 KB

Length

Mean 2.7754
Standard Deviation 0.6315
Median 3
Minimum 1
Maximum 3

Sample

1st row 0.2
2nd row 0.7
3rd row 0.6
4th row 0.3
5th row 0.4

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 383
Decimal Number 20665

MotM

categorical

Approximate Distinct Count 16
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 718.7 KB
  • The largest value (-) is over 2.51 times larger than the second largest value (1)

Length

Mean 1.0009
Standard Deviation 0.02994
Median 1
Minimum 1
Maximum 2

Sample

1st row 9
2nd row 14
3rd row 8
4th row 6
5th row 9

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 6634
Decimal Number 4526
  • The top 2 categories (-, 1) take over 50.0%
  • The largest value (1) is over 2.75 times larger than the second largest value (2)

Rating

numerical

Approximate Distinct Count 218
Approximate Unique (%) 2.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 174.2 KB
Mean 6.7719
Minimum 5.82
Maximum 9.11
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Rating is skewed right (γ1 = 0.3629)

Quantile Statistics

Minimum 5.82
5-th Percentile 6.25
Q1 6.56
Median 6.76
Q3 6.98
95-th Percentile 7.32
Maximum 9.11
Range 3.29
IQR 0.42

Descriptive Statistics

Mean 6.7719
Standard Deviation 0.3251
Variance 0.1057
Sum 75506.15
Skewness 0.3629
Kurtosis 1.0268
Coefficient of Variation 0.048
  • Rating is not normally distributed (p-value 0.0017614178494510776)
  • Rating has 130 outliers

Interactions

Correlations

Missing Values